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BACTERIAL SMALL - MOLECULE THREE -HYBRID SYSTEM 



This application claims priority of U.S. Serial No. 10/132,039, 
filed April 24, 2002, the contents of which are herein 
incorporated by reference into this application. 

This invention has been made with government support under 
National Institutes of Health grant GM62867. Accordingly, the 
U.S. Government has certain rights in the invention. 

Throughout this application, various publications are referenced 
by Arabic numerals in parentheses. Full citations for these 
publications may be found at the end of the specification 
immediately preceding the claims. The disclosures of these 
publications in their entireties are hereby incorporated by 
reference into this application in order to more fully describe 
the state of the art as known to those skilled therein as of the 
date of the invention described and claimed herein. 

Field of Invention 

This invention relates to the field of identifying protein 
targets and their corresponding small -molecule drugs and other 
biomolecules using the techniques of chemically induced 
dimerization ("CID"). 

Background of t H*> Invention 

Affinity chromatography has long been used to identify the 
protein targets of small -molecule drugs and other biomolecules. 
While an essential tool for biochemical research, affinity 
chromatography can often be labor intensive and time consuming. 
Recently the yeast three-hybrid assay, a derivative of the two- 
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hybrid assay, was introduce a straightforward, in vivo 

alternative to affinity chromatography (1,2). The yeast two- 
hybrid system relies on the interaction of two fusion proteins 
to bring about the transcriptional activation of a reporter gene 

5 thus identifying protein-protein interactions in an in vivo 
system (2). The subsequently developed yeast three-hybrid 
system screens for a small molecule-protein interaction based on 
the principle that small ligand -receptor interactions underlie 
many fundamental processes in biology and form the basis for 

10 pharmacological intervention of human diseases in medicine (3) . 
In the three-hybrid assay, protein- small -molecule interactions 
are detected as reconstitution of a transcriptional activator 
("TA") and subsequent transcription of a reporter gene (4-7) . 
A dimeric small -molecule ligand bridges the DNA-binding domain 

15 ( "DBD" ) of the TA, which is fused to the receptor for one 
ligand, and the activation domain ( "AD" ) of the TA, which is 
fused to the receptor for the other ligand. For affinity 
chromatography applications, one ligand- receptor pair is used as 
an anchor and the other is the small -molecule-protein 

20 interaction being investigated. While the yeast three-hybrid 
assay is quite powerful, a bacterial equivalent would increase 
the number of proteins that could be tested by several orders of 
magnitude because the transformation efficiency and doubling 
time of E . coli is significantly greater than that of S, 

25 cerevisiae. In addition, there may be applications where it is 
advantageous to test a eukaryotic protein in a .prokaryotic 
environment where many pathways are not conserved. 

•However, the yeast three-hybrid assay cannot be transferred 
30 directly to bacteria. The components of the transcription 
machinery and the mechanism of transcriptional activation differ 
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signif icantly between bacteri yeast. Ligand- receptor pairs 

often are organism specific because of cell permeability, 
toxicity, or other interactions with the cellular milieu. 
Bacterial two-hybrid assays have only begun to be developed in 
5 the past few years (8) and to date only initial- efforts toward 
the design of a robust bacterial three-hybrid system have been 
reported (9, 10). Described below is the first robust small- 
molecule bacterial three-hybrid system. 
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Summary of the Invention 

This invention provides a transgenic bacterial cell comprising 

(a) a dimeric small molecule which comprises a first moiety 
known to bind a first receptor domain covalently linked to 

5 a second moiety capable of binding a second receptor 

domain, wherein the first and second moieties are 
different ; 

(b) nucleotide sequences which upon transcription encode 

i) a first fusion protein comprising the first 
10 receptor domain, and 

ii) a second fusion protein comprising the second 
receptor domain; and 

(c) a reporter gene wherein expression of the reporter 
gene is conditioned on the proximity of the first fusion 

15 protein to the second fusion protein. 

This invention also provides a method for identifying a molecule 
that binds a known target receptor in a bacterial cell from a 
pool of candidate molecules, comprising: 
20 (a) forming a dimeric molecule by covalently bonding each 

molecule in the pool of candidate molecules to a ligand 
capable of selectively binding to a receptor ; 
(b) introducing the dimeric molecule into a bacterial cell 
culture comprising bacterial cells that express a first 
25 fusion protein which comprises the known target receptor 

domain against which the candidate molecule is screened, a 
second fusion protein which comprises the receptor domain 
to which the ligand selectively binds, and a reporter gene 
wherein expression of the reporter gene is conditioned on 
30 the proximity of the first fusion protein to the second 

fusion protein; 
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(c) permitting the dime )lecule to bind to the first 
fusion protein and to the second fusion protein, bringing 
the two fusion proteins into proximity so as to activate 
the expression of the reporter gene; 

(d) selecting the bacterial cell that expresses the 
reporter gene; and 

(e) identifying the small molecule that binds the known 
target receptor . 

This invention further provides a method for identifying an 
unknown target receptor to which a known molecule is capable of 
binding in a bacterial cell, comprising: 

(a) providing a dimeric molecule having a first ligand 
which has a specificity for the unknown target receptor 
covalently bonded to a second ligand capable of selectively 
binding to a receptor; 

(b) introducing the dimeric molecule into a bacterial cell 
which expresses a first fusion protein which comprises the 
unknown target receptor domain, a second fusion protein 
which comprises the receptor domain to which the second 
ligand selectively binds, and a reporter gene wherein 
expression of the reporter gene is conditioned on the 
proximity of the first fusion protein to the second fusion 
protein; 

(c) permitting the dimeric molecule to bind to the first 
fusion protein and to the second fusion protein so as to 
activate the expression of the reporter gene; 

(d) selecting which bacterial cell expresses the unknown 
target receptor; and 

(e) identifying the unknown target receptor. 
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This invention also provic transgenic bacterial cell 

comprising 

(a) a dimeric small molecule which comprises a methotrexate 
moiety covalently linked to a moiety capable of binding a 
receptor domain; 

(b) nucleotide sequences which upon transcription encode 

i) a first fusion protein comprising a DHFR domain and 
a first fragment of an enzyme , v and 

ii) a second fusion protein comprising the receptor 
domain and a second fragment of the enzyme, wherein 
activity of the enzyme is conditioned on the proximity 
of the first fragment of the enzyme to the second 
fragment of the enzyme. 

This invention further provides a method for identifying a 
molecule that binds a known target receptor in a bacterial cell 
from a pool of candidate molecules, comprising: 

(a) forming a dimeric molecule by covalently bonding each 
molecule in the pool of candidate molecules to a 
methotrexate moiety; 

(b) introducing the dimeric molecule into a bacterial cell 
culture comprising bacterial cells that express a first 
fusion protein which comprises the known target receptor 
domain against which the candidate molecule is screened, 
and a first fragment of an enzyme, and a second fusion 
protein which comprises a DHFR receptor domain and a second 
fragment of the enzyme; 

(c) permitting the dimeric molecule to bind to the first 
fusion protein arid to the second fusion protein, bringing 
the first fragment and the second fragment of the enzyme in 
to proximity so as to reconstitute the activity of the 
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enzyme ; 

(d) selecting the bacterial cell that exhibits the activity 
of the enzyme; and 

(e) identifying the small molecule that binds the known 
5 target receptor. 

This invention also provides a method for identifying an unknown 
target receptor to which a known molecule is capable of binding 
in a bacterial cell, comprising: 
10 (a) providing a dimeric molecule having a first ligand 

which has a specificity for the unknown • target receptor 
covalently bonded to a methotrexate moiety; 

(b) introducing the dimeric molecule into a bacterial cell 
which expresses a first fusion protein which comprises the 

15 unknown target receptor domain, and a first fragment of an 

enzyme, and a second fusion protein which comprises a DHFR 
receptor domain and a second fragment of the enzyme; 

(c) permitting the dimeric molecule to bind to the first 
fusion protein and to the second fusion protein, bringing 

20 the first fragment and the second fragment of the enzyme in 

to proximity so as to reconstitute the activity of the 
enzyme ; 

(d) selecting the bacterial cell that exhibits the activity 
of the enzyme ; and 

25 (e) identifying the unknown target receptor. 

Also provided by this invention is are kits for carrying out the 
methods . 
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Description of the Fig-ares 

Figure 1. Bacterial RNA polymerase two- and three-hybrid 
systems. (A) Dove and Hochschild built a bacterial two-hybrid 
system based on their observation that kcl -mediated recruitment 
of RNA polymerase is sufficient to activate gene transcription. 
The DNA operator of Xcl is placed upstream of the RNA polymerase 
binding site for a lacZ reporter gene. Using the known 
interaction between the proteins, Gal4 and Galll p , as a proof of 
principle, they showed that co-expression of A,cl-Gal4 and aNTD- 
Galll p fusion proteins was sufficient to activate transcription 
of the lacZ gene. XcZ binds to its operator and through the 
binding of Gal4 and Galll p is effectively associated with cxNTD . 
ocNTD is the N- terminal domain of the a-subunit of RNA polymerase 
and is used to localize the entire RNA polymerase machinery to 
the RNA polymerase binding site and thus promote transcription 
of the JacZ gene. (B) In our bacterial three-hybrid system, the 
interaction between lei and aNTD is provided by a heterodimeric 
small molecule, Mtx-SLF . Mtx-SLF bridges between the fusion 
proteins, XcI-FKBP12 (the receptor for SLF) and aNTD-DHFR (the 
receptor for Mtx) , activating transcription of the lacZ reporter 
. gene . 

Figure 2. Retrosynthetic analysis of Mtx-SLF. 

Figure 3. p-Galactosidase assays establish that transcriptional 
activation is small -molecule dependent in the bacterial three- 
hybrid system. X-Gal indicator plates are shown for the 
different constructs used to verify the three-hybrid system. 
Each column corresponds to strain V674E bearing plasmids 
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expressing Xcl and aNTD fusiw.. proteins involved in the RNA 
polymerase two- and three-hybrid assays: 1, A.cl-Gal4, aNTD- 
Galll p ; 2, Xcl , aNTD; 3 , >.d -FKBP12 , aNTD; 4, A,cI-FKBP12, aNTD- 
DHFR; 5, Xcl , aNTD-DHFR . 1 is the Gal4-Galll p dii'ect protein- 

5 protein interaction used as a positive control. 2, 3, and 5 
lack both of the necessary receptor proteins to test the 
necessity of all three components for transcriptional 
activation. Plate A contains no Mtx-SLF; and plate B, 10 |iM 
Mtx-SLF. The plates are LB solid media containing 40 mg/mL X- 

10 Gal, 0.5 mM IPTG, 0 . 5 mM tPEG, 100 mg/mL ampicillin, and 6 mg/mL 
chloramphenicol . 

Figure 4. The levels of small -molecule induced transcriptional 
• activation were quantified, using liquid lacZ assays. The 
15 strains here correspond exactly to those in Figure 2 and are 
used in liquid ONPG assays where the levels of transcriptional 
activation can be quantitated based on the amount of reporter 
protein, b-galactosidase , which is produced. The strains are 
V674E bearing plasmids encoding Xcl and aNTD fusion proteins: 1, 
20 Jlcl-Gal4, oNTD-Galll p ; 2, Xcl, aNTD; 3, X,cI-FKBP12, aNTD; 4, Xcl- 
FKBP12, aNTD-DHFR; 5, Xcl, aNTD-DHFR. The strains were assayed 
in triplicate from three transf ormants and standard deviations 
are shown. The strains are grown in LB with 0.5 mM IPTG, 100 
^lg/mL ampicillin, 6 Kig/mL chloramphenicol, and small molecules 
25 at the indicated concentration. 1 is the Gal4-Galll p direct 
protein-protein interaction Mtx-SLF independent positive 
control, only 4 contains both receptor proteins for Mtx-SLF as 
the other strains lack either one or both of the receptor 
proteins. The last two small -molecule concentrations are 
30 conpetition assays in whioh an excess of one of the ligands for 
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the receptor proteins was u 



o compete out the positive 



signal due to the three-hybrid system. 

Figure 5. The levels of transcriptional activation depend on 
5 Mtx-SLF concentration in the bacterial three-hybrid system. The 
concentrations of Mtx-SLF in the media were varied, and the 
levels of lacZ transcription, were quantitated in liquid culture 
using ONPG. The strains are V674E expressing the following A.cl 
and cxNTD fusion proteins: (♦) , X,cl-Gal4, aNTD-Galll* (a direct 
10 protein-protein interaction) ; (-0 , A.cl -FKBP12 , aNTD (a negative 
control); and (▲) , A.CI-FKBP12, aNTD-DHFR (the three-hybrid 
system) . The rate of ONPG hydrolysis was measured in triplicate 
from three different transf ormants after growth in LB media 
containing 0.5 mM IPTG, 100 Mg/mL ampicillin, 6 ng/mL 
15 chloramphenicol, and Mtx-SLF at the indicated concentrations. 
The standard deviation for each data point is also shown. 
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Detailed Description of the I ion 

This invention provides a transgenic bacterial cell comprising 

(a) a dimeric small molecule which comprises a first moiety 
5 known to bind a first receptor domain covalently linked to 

a second moiety capable of binding a second receptor 
domain, wherein the first and second moieties are 
different ; 

(b) nucleotide sequences which upon transcription encode 
10 i) a first fusion protein comprising the first 

receptor domain, and 

ii) a second fusion protein comprising the second 
receptor domain; and 

(c) a reporter gene wherein expression of the reporter 
15 gene is conditioned on the proximity of the first fusion 

protein to the second fusion protein. 

The dimeric small molecule may have the formula: 

HI - Y - H2 

20 wherein each of HI and H2 may be the same or different and 
capable of binding to a receptor which is the same or different 
with a IC 50 of less than 100 jxM; and Y is a linker which may be 
present or absent. Each of HI and H2 can be capable of binding 
to a receptor with a IC S0 of less than 10 p.M; or with a IC 50 of 

25 less than 1 jiM; or with a IC 50 of less than 100 nM; or with a IC S0 
of less than 10 nM; or with a IC 50 of less than 1 nM. 

Each of HI and H2 may be a methotrexate moiety, FK506 moiety, an 
FK506 analog, a tetracycline moiety, or a cephera moiety. In a 
30 preferred embodiment, HI or H2 is a methotrexate moiety. In 
another preferred embodiment, HI or H2 is an FK506 analog. The 
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FK506 analog may have the sti a: 



OMe 



OMe 




10 



In a specific embodiment , ' the dimeric small molecule may have 
the structure: 



OMe 



OMe 




HoN N N 



wherein n is an integer from 1 to 20 , preferably n is 
an integer from 2 to 12 , more preferably n is ain integer from 3 
15 to 9, in a specific embodiment, n is 8 . 



20 



The first fusion protein can further comprise a DNA binding 
domain, and the second fusion protein can further comprises a 
transcription activation domain. Alternatively, the first 
fusion protein can further comprises a transcription activation 
domain, and the second fusion protein can further comprises a 
DNA binding domain. 



The transcription activation domain can be aNTD. The DNA 
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binding domain can be X.cl , Ax^w, ^axA, Gal4, or zinc fingers. 

The first ox- the second receptor domain can be that of 
dihydrof olate reductase ( "DHFR" ) , glucocorticoid receptor, 
5 FKBP12, FKBP mutants, tetracycline repressor, or a penicillin 
binding protein. The DHFR can be the E.coli DHFR ( "eDHFR" ) . 

In a specific embodiment, the first fusion protein is DHFR-X-d 
or FKBP12 - A.cl ; and the second fusion protein is DHFR-aNTD or 
10 FKB P 1 2 - cxNTD . 

The reporter gene can be Lac Z, araBAD, aadA (spectinomycin 
resistance), his3, p-lactamase, GFP, luciferase, TetR 
(tetracyclin resistance), KanR (kanamycin resistance) , or Cm 
15 (chloramphenicol resistance). In a specific embodiment, the 
reporter gene is Lac Z. 

This invention also provides a transgenic bacterial cell 
comprising 

20 (a) a dimeric small molecule which comprises a methotrexate 

moiety covalently linked to a moiety capable of binding a 
receptor domain; 

(b) nucleotide sequences which upon transcription encode 

i) a first fusion protein comprising a DHFR domain, 

25 and 

ii) a second fusion protein comprising the receptor 
domain; and 

(c) a reporter gene wherein expression of the reporter 
gene is conditioned on the proximity of the first fusion 

30 protein to the second fusion protein. 
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The elements of this cell arc efined previously. 

This invention also provides a method for identifying a molecule 
that binds a known target receptor in a bacterial cell from a 
pool of candidate molecules, comprising: 

(a) forming a dimeric molecule by cova lent ly bonding each 
molecule in the pool of candidate molecules to a ligand 
capable of selectively binding to a receptor; 

(b) introducing the dimeric molecule into a bacterial cell 
culture comprising bacterial cells that express a first 
fusion protein which comprises the known target receptor 
domain against which the candidate molecule is screened, a 
second fusion protein which comprises the receptor domain 
to which the ligand selectively binds, and a reporter gene 
wherein expression of the reporter gene is conditioned on 
the proximity of the first fusion protein to the second 
fusion protein ; 

(c) permitting the dimeric molecule to bind to the first 
fusion protein and to the second fusion protein, bringing 
the two fusion proteins into proximity so as to activate 
the expression of the reporter gene; 

(d) selecting the bacterial cell that expresses the 
reporter gene; and 

(e) identifying the small molecule that binds the known 
target receptor. 

In the method, the steps (b) - (e) of the method can be 
iteratively repeated in the presence of a preparation of random 
small molecules for competitive binding with the screening 
molecule so as to identify a molecule capable of competitively 
binding the known target receptor. 
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The dimeric molecule in the lod can be obtained from a 

combinatorial library. The dimeric molecule in the method can 
comprise a ligand capable of selectively binding to a receptor 
with a IC 50 of less than 100 |iM; or with a IC 50 of less than 10 
5 fiM; or with a IC 50 of less than 1 fiM,- or with a IC 50 of less than 
100 nM; or with a IC 50 of less than 10 nM; or with a IC56 of less 
than 1 nM . 

The dimeric molecule in the method can comprises a methotrexate 
10 moiety, FK506 moiety, an FK506 analog, a tetracycline moiety , or 
a cephem moiety. In a specific embodiment , the dimeric molecule 
comprises a methotrexate moiety. In another specific 

embodiment, the dimeric molecule comprises an FK506 analog. 

15 The FK506 analog can have has the structure: 



20 




25 The first fusion protein in the method can further comprise a 
DNA binding domain, and the second fusion protein further 
comprises a transcription activation domain. Alternatively, the 
first fusion protein in the method can further comprise a 
transcription activation domain, and the second fusion protein 

30 further comprises a DNA binding domain. 



I 
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The transcription activation ^„.~^n can be aNTD. 

The DNA-binding domain can be A,cl , AraC, LexA, Gal4 , or zinc 
fingers. 

5 

The first or the second fusion protein in the method can 
comprise a receptor domain of dihydrof olate reductase ("DHFR"), 
glucocorticoid receptor, FKBP12, FKBP mutants, tetracycline 
repressor, or a penicillin binding protein. The DHFR can be the 
10 £T.coli DHFR ( "eDHFR" ) . 

In the method, the first fusion protein can be DHFR-XcI or 
FKBP12-A.CI. The second fusion protein can be DHFR-aNTD or 
FKBP12 -aNTD . 

15 

The reporter gene can be .Lac Z, araBAD, aadA, his3 , p-lactamase, 
GFP, lucif erase, TetR, KanR, Cm. In a specific embodiment, the 
reporter gene is Lac Z. 

20 This invention also provides a method for identifying an unknown 
target receptor to which a known molecule is capable of binding 
in a bacterial cell, comprising: 

(a) providing a dimeric molecule having a first ligand 
which has a specificity for the unknown target receptor 

25 covalently bonded to a second ligand capable of selectively 

binding to a receptor; 

(b) introducing the dimeric molecule into a bacterial cell 
which expresses a first fusion protein which comprises the 
unknown target receptor domain, a second fusion protein 

30 which comprises the receptor domain to which the second 
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ligand selectively bin nd a reporter gene wherein 

expression of the reporter gene is conditioned on the 
proximity of the first fusion protein to the second fusion 
protein; 

5 (c) permitting the dimeric molecule to bind to the first 

fusion protein and to* the second fusion protein so as to 
activate the expression of the reporter gene; 
(d) selecting which bacterial cell expresses the unknown 
target receptor; and 
10 (e) identifying the unknown target receptor. 

The unknown target receptor is encoded by a DNA from the group 
consisting of genomicDNA, cDNA and syntheticDNA. 

15 The steps (b). -'(e) of the method can be iteratively repeated so 
as to identify the unkown target receptor. 

The other elements of this method are as defined previously. 

20 This invention also provides a kit for identifying a molecule 
that binds to a known target in a bacterial cell from a pool of 
candidate molecules, comprising 

(a) a host bacterial cell containing a reporter gene that 
is expressed only when bound to a DNA-binding domain and 

25 when in the proximity of a transcription activation domain; 

(b) a first vector containing a promoter that functions in 
the host bacterial cell and a DNA encoding a DNA-binding 
domain ; 

(c) a second vector containing a promoter that functions in 
30 the host bacterial cell and a DNA encoding a transcription 

activation domain; 
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(d) a dimeric small mole tfhich comprises a moiety that 
binds to a known receptor domain covalently linked to a 
candidate molecule; 

(e) a means for inserting into the first vector or the 
5 second vector a DNA encoding the known receptor domain in 

such a manner that the known receptor domain and an 
expression product of the first or second vector are 
expressed as a fusion protein; 

(f) a means for inserting into the first vector or the 
10 second vector a DNA encoding the known target receptor in 

such a manner that the known target receptor and an 
expression product of the first or second vector are 
expressed as a fusion protein; and 

(g) a means for transfecting the host cell with the first 
15 vector, and the second vector, wherein binding of the 

dimeric small molecule to the known target receptor results 
in a measurably greater expression of the reporter gene 
then in the absence of such binding. 

20 The elements of this kit are as defined previously. 

This invention further provides a kit for identifying an unknown 
target receptor to which a molecule is capable of binding in a 
bacterial cell, comprising 
25 (a) a host bacterial cell containing a reporter gene that* 

is expressed only when bound to a DNA- binding domain and 
when in the proximity of a transcription activation domain ; 

(b) a first vector containing a promoter that functions in 
the host bacterial cell and a DNA encoding a DNA-binding 

30 domain ; 

(c) a second vector containing a promoter that functions in 
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the host bacterial cell DNA encoding a transcription 

activation domain; 

(d) a dimeric -small molecule which comprises a moiety that 
binds a known receptor domain covalently linked to a moiety 

5 against which the unknown target is to be screened for 

binding; 

(e) a means for inserting into the first vector or the 
second vector a DNA encoding the known receptor domain in 
such a manner that the known receptor domain and an 

10 expression product of the first or second vector are 

expressed as a fusion protein; 

(f) a means for inserting into the first vector or the 
second vector a DNA encoding the unknown target receptor in 
such a manner that the unknown target receptor and an 

15 expression product of the first or second vector are 

expressed as a fusion protein; and 

(g) a means for transf ecting* the host cell with the first 
vector, and the second vector, wherein binding of the 
dimeric small molecule to the unknown target receptor 

20 results in a measurably greater expression of the reporter 

gene then in the absence of such binding. 

The elements of this kit are as defined previously. 

25 This invention yet further provides a process for screening a 
chemical library for a molecule that binds a known target 
receptor, comprising providing a chemical library/ providing a 
bacterial cell that expresses the known target receptor as a 
part of a fusion protein, and identifying the molecule that 

30 binds the known target receptor in the bacterial cell by the 
method described above. As a result, this invention altso 
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provides a molecule identii oy the method, wherein the 

molecule was not previously known to bind the known target 
receptor . 

5 This invention yet further provides a process for screening a 
cDNA library for a nucleic acid that encodes a receptor to which 
a known molecule binds, comprising providing a cDNA library, 
providing a dimeric molecule having two ligands, of which one • 
ligand is the known molecule, and identifying the nucleic acid 

10 that encodes the receptor to which the known molecule binds by 
the method described above. As a result, this invention also 
provides an isolated nucleic acid identified by the process, 
wherein the isolated nucleic acid was not previously known to 
encode a receptor to which the known molecule binds. The 

15 isolated nucleic acid can encode an enzyme or a portion thereof . 

This invention also provides a transgenic bacterial cell 
comprising 

(a) a dimeric small molecule which comprises a methotrexate 
20 moiety covalently linked to a moiety capable of binding a 

receptor domain; 

(b) nucleotide sequences which upon transcription encode 

i) a first fusion protein comprising a DHFR domain and 
a first fragment of an enzyme, and 
25 ii) a second fusion protein comprising the receptor 

domain and a second fragment of the enzyme, wherein 
activity of the enzyme is conditioned on the proximity 
of the first fragment of the enzyme to the second 
fragment of the enzyme. 

30 



The elements of the cell are as described above. 
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This invention also provides a method for identifying a molecule 
that binds a known target receptor in a bacterial cell from a 
pool of candidate molecules, comprising: 

(a) forming a dimeric molecule by covalently bonding each 
molecule in the pool of candidate molecules to a 
methotrexate moiety ; 

(b) introducing the dimeric molecule into a bacterial cell 
culture comprising bacterial cells that express a first 
fusion protein which comprises the known target receptor 
domain against which the candidate molecule is screened, 
and a first fragment of an enzyme, and a second fusion 
protein which comprises a DHFR receptor domain and a second 
fragment of the enzyme; 

(c) permitting the dimeric molecule to bind to the first 
fusion protein and to the second fusion protein, bringing 
the first fragment and the second fragment of the enzyme in 
to proximity so as to reconstitute the activity of the 
enzyme ; 

(d) selecting the bacterial cell that exhibits the activity 
of the enzyme; and 

(e) identifying the small molecule that binds the known 
target receptor. 

The elements of this method are as defined previously. 

This invention also provides a method for identifying an unknown 
target receptor to which a known molecule is capable of binding 
in a bacterial cell, comprising: 

(a) providing a dimeric molecule having a first ligand 
which has a specificity for the unknown target receptor 
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covalently bonded to a n rexate moiety ; 



(b) introducing the dimeric molecule into a bacterial cell 
which expresses a first fusion protein which comprises the 
unknown target receptor domain, and a first fragment of an 
enzyme, and a second fusion protein which comprises a DHFR 
receptor domain and a second fragment of the enzyme; 

(c) permitting the dimeric molecule to bind to the first 
fusion protein and to the second fusion protein, bringing 
the first fragment and the second fragment of the enzyme in 
to proximity so as to reconstitute the activity of the 
enzyme; 

(d) selecting the bacterial cell that exhibits the activity 
of the enzyme; and 

(e) identifying the unknown target receptor. 

The elements of this method are as defined previously. 

Each of the ligand halves of the dimeric small molecule may be 
derived from a compound selected from the group consisting of 
steroids, hormones, nuclear receptor ligands, cofactors, 
antibiotics, sugars, enzyme inhibitors, and drugs. 

Each of the ligand halves of the dimeric small molecule may also 
represent a compound selected from the group consisting of 
3, 5,3 1 -triiodothyronine, trans-retinoic acid, biotin, 

coumermycin, tetracycline, lactose, methotrexate, FK506, and 
FK506 analogs. 

In the described methods, the screening is performed by 
Fluorescence Associated Cell Sorting (FACS) , or gene 
transcription markers selected from the group consisting of 
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Green Fluorescence Protein, L r . galagctosidases , luciferase, 

antibiotic resistant b-lactamases, and yeast markers. 

The known moiety that binds a known receptor domain can be a 
5 Methotrexate moiety or an analog thereof- The known receptor 
domain can be dihydrof olate reductase ( "DHFR" ) generally, or the 
E . col i DHFR ( u eDHFR" ) . Alternatively, the pairing can be 

FK506/FKBP12 , AP series of synthetic FK506 analogs/FKBPs, 
tetracycline/tetracycline repressor, cephem/penicillin binding 
10 protein. The penicillin binding domain can be from Streptomyces 
R61. 

Each of the methods is readily adapted t6 identify which 
substrate from a pool of candidate substrates is selected in a 
15 cell by a known enzyme for a bond forming reaction between the 
substrate and a known amino acid. 

The described methods, cell and kit may also be adapted to 
identify new protein targets for pharmaceuticals. 

20 

The described methods, cell and kit may also be adapted for 
determining the function of a protein, further including 
screening with a natural cof actor being part of the CID. 

25 The described methods, cell and kit may also be adapted for 
determining the function of a protein, further including 
screening with a natural substrate being part of the CID. 

The described methods, cell and kit may also be adapted for 
30 screening a compound for the ability to inhibit a ligand- 
receptor interaction. 
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The foregoing embodiments < 



subject invention may be 



accomplished according to the guidance which follows. Certain 
of the foregoing embodiments are exemplified. Sufficient 
guidance is provided for a skilled artisan to arrive at all of 
5 the embodiments of the subject invention. 

Preparation and design of 

ligand halves of the dimeric small molecule 

10 A ligand half should bind its receptor with high affinity 100 
nM) , cross cell membranes yet be inert to modification or 
degradation, be available in reasonable quantities, and present 
a convenient side-chain for routine chemical derivatization that 
does not disrupt receptor binding. 



A commercial source of traditional, non-covalent dimeric 
molecules for use in a chemically induced dimerization system is 
ARIAD (www.ariad.com), who call their CID "ARGENT TECHNOLOGY." 
The compounds described herein as well as other commercial 
20 compounds can be derivatized for use in the bacterial three- 
hybrid system. 

For example, methotrexate ("Mtx") is an attractive ligand half 
(also referred to as "chemical handle"). Mtx inhibition of 

25 dihydrof olate reductase (DHFR) is one of the textbook examples 
of high-affinity ligand binding. The interaction between Mtx 
and DHFR is extremely well characterized in the literature both 
biochemically and structurally. DHFR is a monomeric protein and 
binds Mtx with picomolar affinity (11) - Even though Mtx 

30 inhibits DHFR with such high affinity, £?. ooli grows in the 
presence of Mtx when supplemented with appropriate nutrients 
(12). The ability of Mtx to serve both as an antibacterial and 



15 
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an anticancer agent is clea Jence that Mtx has excellent 

pharmacokinetic properties. Mtx is known to be imported into 
cells via a specific folate transporter protein. Mtx is 
commercially available and can be synthesized readily from 
5 simple precursors. Mtx can be modified selectively at its g- 
carboxylase without disrupting its interaction with DHFR (11, 
13). Mtx is commercially available and can be modified 

selectively at its .a-carboxylate without disrupting 
dihydrofolate reductase (DHFR) binding (11 , 13) . Even though 
10 Mtx inhibits DHFR with pM affinity (12) , both E. coli and S. 
cerevisiae grow in the presence of Mtx when supplemented with 
appropriate nutrients (13) . 

However, not all ligand halves, also referred to as chemical 
15 handles, can be readily used in a given organism without the 
need to modify the organism. For instance, dexamethasone 
requires several heat -shock proteins ("HSPs"), in order to be 
bound by the glucocorticoid receptor ("GR") . The required HSPs 
occur in yeast and other eukaryotes; however, since bacteria 

20 lack these HSPis and thus GR fails to bind Dex, Dex based small 
molecules would not readily work in a bacterial three-hybrid 
system. Similarly, using Mtx in bacteria requires special 
adaptation of the bacterial cell as Mtx can actually act as an 
antibiotic, as discussed in more detail in the examples. Mtx is 

25 exported from the bacterial cell by the TolC- dependent multi- 
drug efflux pump (32) . Knocking out the tolC gene allows Mtx- 
based molecules to not only enter the bacterial -cell but remain 
there as well. Furthermore, knocking out thyA removed, the toxic 
effects associated with Mtx by preventing the bio -accumulation 

30 of the toxic dihydrofolate substrate of DHFR (32) . However, as 
discussed in detail in the example, knocking out the thyA, e.g 
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through a mutation, is not i ary at the concentrations of 

Mtx that is used in the described bacterial three-hybrid system. 
While to use Mtx in the exemplified system, one must only knock 
out the tolC gene/protein in order to assure that Mtx-based 
5 molecules remain in the cell, it is clearly possible to also 
knock out thyA for higher concentrations of Mtx. 

Other ligand halves may be, for example, steroids; enzyme 
inhibitors, such as Methotrexate used herein; drugs, such as 
10 FK506 ; hormones, such as the thyroid hormone 3,5,3'- 
triiodothyronine; ligands for nuclear receptors may be retinoic 
acids; general cof actors, such as Blot in; and antibiotics, such 
as Coumermycin (which can be used to induce protein dimerization 
according to Perlmutter et al . , Nature 383, 178 (1996)). 

15 

One, or both, of the ligand halves may also be a moiety that is 
a substrate for an enzyme, i.e. a moiety that would not on its 
j own bind to a protein, but would require an enzyme to assist 
bonding to a protein. In this way, the system can be made 
20 dependent on an enzyme, and would only operate when an enzyme is 
present . Such an enzyme would form a covalent bond between the 
small non-peptide molecule and a protein. This would provide 
flexibility in the system by allowing for one of the non- 
covalent interactions to be replaced by a covalent interaction. 

25 

Similarly, one, or both, of the ligand halves may be a moiety 
that spontaneously seeks out and forms a covalent bond with a 
receptor. An example of this is the interaction between 
Fluorouracil and Thymidylate Synthase, and another between 
30 Cephen and the Penicillin Binding Protein. This would also 
provide flexibility in the system by allowing for one of the 
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non-covalent interactions t replaced by a covalent 

interaction. 

Tnnkaafi of the ligand halves in the dimeric small molecule 
5 While the ligand halves can be simply linked by a covalent bond 
between the two of them, more elaborate linkages may also be 
used depending on the screen to be performed. The linkage may 
be formed by any of the methods known in the art (14,15). 
Descriptions of linkage chemistries are also provided by WO 
10 94/18317, WO 95/02684, WO 96/13613, WO96/06097, and WO 01/53355, 
these references being incorporated herein by reference. The 
linkers are all commercially available or can be prepared in a 
single step. The linkers vary in hydrophobicity , length, and 
flexibility. 

15 The linker may be designed to respond to enzymatic activity. 
For example, a linker can contain a glycosidase bond, which may 
be cleaved by a glycosidase enzyme and formed by a 
Glycosyltransf erase enzyme; or an amide bond, which may be 
cleaved by a protease and formed by peptidase or transpeptidase; 

20 or an aldol product bond, which is cleaved by a retro- aldolase 
and formed by aldolase; or an ester bond; or a phosphodiester 
bond. Such bonds can be used in bacterial based screens 
similarly to their use in yeast based screens, which are 
described in WO 01/53355. 



25 



30 



The enzymes that act on such linkers may be known enzymes or 
novel proteins which are being screened for specific enzymatic 
activity. Novel enzymes can be evolved using combinatorial 
techniques. 

Once a desired substrate is selected and formed into the dimeric 
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small molecule, a large num3 enzymes and derivatives of 

enzymes can be screened. A variety of enzymes and enzymes 
classes are listed on the World Wide Web beginning at 
prowl.rockefeller.edu/enzymes/enzymes.htm. All enzymes are 
5 given an Enzyme Commission (B.C.) number allowing it to be 
uniquely identified. E.C. numbers have four. fields separated by 
periods, "a.b.c.d". The left-hand-most field represents the 
most broad classification for the enzyme. The next field 
represents a finer division of that broad category. The third 

10 field adds more detailed information and the fourth field 
defines the specific enzyme. Thus, in the w a" field the 
classifications are oxidoreductases, transferases, hydrolases, 
lyases, isomerases, and ligases. Each of these "a" 

classifications are then further separated into corresponding 

15 "b" classification, each of which in turn is separated into 
corresponding "c" classifications, which are then further 
separated into corresponding "d" classes. 

Moreover, new enzymes are discovered and are intended to be 
20 included within the scope of this invention, which is itself 
designed to evolve or discover such new enzymes. 

Design of the protein chimeras 

The second important feature is the design of the protein 
25 chimeras. A number of chimeras are discussed in detail in WO 
01/53355. 

The described bacterial three-hybrid system is based on the RNA 
polymerase two-hybrid system reported by Dove and Hochschild in 
30 1997 (16) . A variety of methods for detecting protein-protein 
interactions in bacteria are now available. (8-10, 16-19) . 
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Generally, these methods based either on enzyme 

complementation or transcriptional activation or repression 
assays. While the enzyme complementation assays are essentially 
the same as those used in eukaryotes, entirely new 
5 transcription-based assays had to be developed for bacteria 
because the components of the transcription machinery are poorly 
conserved between eukaryotes and prokaryotes. The choice to 
adapt the RNA polymerase assay developed by Dove and Hochschild 
was because transcriptional activation in this assay results in 

10 a large increase in reporter gene transcription and because 
reconstitution of transcriptional activation was expected to be 
largely conformation independent. Based on their studies of the 
mechanism of transcriptional activation by ^-repressor (Tlcl) (20) , 
Dove and Hochschild developed an in vivo assay for protein- 

15 protein interactions based on dimerization of v kl and the Jtf- 
terminal domain of the a-subunit of RNA polymerase (aNTD) . 
Specifically, they designed a reporter construct consisting of 
a kcl operator upstream of the TATA box for the lacZ gene, which 
encodes p-galactosidase . They then prepared plasmids encoding 

20 a Gal4-kcl fusion protein and a Galll p -oNTD fusion protein. Gal4 
and Galll* are two proteins known to interact with high affinity 
and so provided a well-established test case. Finally, the 
interaction between Gal4-A.cl and Galll p -aNTD was shown to 
activate transcription of the lacZ reporter gene (21) . This 

25 original Gal4-Galll p two-hybrid system is used as a small- 
molecule independent positive control in the example provided 
below. 

pegign of reporter genes 
30 A reporter gene assay measures the activity of a gene's 
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promoter. It takes advantagt olecular biology techniques, 

which allow one to put heterologous genes under the control of 
a bacterial cell. Activation of the promoter induces the 
reporter gene as well as or instead of the endogenous gene. By 

5 design, the reporter gene codes for a protein that can easily be 
detected and measured- Commonly it is an enzyme that converts 
a commercially available substrate into a product. This 
conversion is conveniently followed by either chromatography or 
direct optical measurement and allows for the quantification of 

10 the amount of enzyme produced. 

Reporter genes are commercially available on a variety of 
plasmids for the study of gene regulation in a large variety of 
organisms (22) . Promoters of interest can be inserted into 

15 multiple cloning sites provided for this purpose in front of the 
reporter gene on the plasmid (23, 24) . Standard techniques are 
used to introduce these genes into a cell type or whole organism 
(e.g., as described in Sambrook, J., Fritsch, E.F. and Maniatis, 
T. Expression of cloned genes in cultured mammalian cells. In: 

20 Molecular Cloning, edited by Nolan, C. New York: Cold Spring 
Harbor Laboratory Press, 1989) . Resistance markers provided on 
the plasmid can then be used to select for successfully 
transfected cells. 

25 Ease of use and the large signal amplification make this 
technique increasingly popular ih the study of gene regulation. 
Every step in the cascade DNA --> RNA —> Enzyme — > Product 
Signal amplifies the next one in the sequence. The further down 
in the cascade one measures, the more signal one obtains, 

30 

In an ideal reporter gene assay, the reporter gene under the 
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control of the promoter of i„_„_.jt is transfected into cells, 
either transiently or stably. Receptor activation leads to a 
change in enzyme levels via ti-anscriptional and translational 
events. The amount of enzyme present can be measured via its 
enzymatic action on a substrate. 



Reconstitution of enzvme actjv^v 

In lieu of the protein chimeras or reporter genes as described 
10 above, the reassembly of an enzyme, and thus its activity, can 
be used as a reporter system. Complementation between enzyme 
fragments has been observed in numerous cases since the 
pioneering discovery of intracistronic complementation by 
Ullmann and colleagues in 1967. The so-called S- complement at ion 
15 between two truncated 0-galactosidase polypeptides became a 
classical tool in molecular biology for cloning techniques and 
has recently found new interest in cell biology as a sensitive 
marker of eukaryotic cell fusions and a reporter. 

20 Potentially, a large number of enzymes might be split into two 
complementary fragments that could reassociate t.o reconstitute 
the native enzymatic activity. Functional complementation 
between two enzyme fragments can be exploited to design a three- 
hybrid system provided that: (i) the cognate enzymatic activity 

25 is easily detected or selected for in vivo; (ii) the two 
fragments do not reassociate spontaneously when expressed as 
separate entities (as do, for example, the classical LacZ-S and 
a-acceptor) ; (iii) when fused to interacting polypeptides, the 
two fragments reassociate in an enzymatically active complex. 

30 A variety of enzyme reassembly systems have been described (37) . 
For example, a protein complementation assay based on engineered 
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fragments of mouse dihydrofc reductase (mDHFR) has been 

described (38) . Subsequently, the same group used the mDHFR 
complementation assay to select pairs of leucine zippers that 
selectively heterodimerize . 

5 

One advantage of an enzyme reconstitution assay is that it can 
work in cells which do not endogenously express the enzyme, thus 
providing an efficient reporter system. 

10 This invention will be better understood from the Experimental 
Details which follow. However, one skilled in the art will 
readily appreciate that the specific methods and results 
discussed are merely illustrative of the invention as described 
more fully in the claims which follow thereafter. 



15 
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EXPERI MENTAL DETAILS 

Part I - Preparation of dixneric small molecule liaand 
In order to convert the Dove and Hoschild two-hybrid assay into 
5 a three-hybrid system, it was initially important to design a 
dixneric ligand that could bridge Xcl and aNTD via the ligand' s 
receptors- For the bridging small molecule, a heterodimer of 
methotrexate (Mtx) and a synthetic analog of FK506 (SLF) was 
prepared. This heterodimer is referred to in this description 

10 Mtx- SLF . Mtx- SLF was used to dimerize a A,cI-FK506 binding 
protein 12 (XcI-FKBP12) protein chimera and an aNTD- 
dihydrof olate reductase (aNTD-DHFR) protein chimera as shown in 
Figure 1. Methotrexate (Mtx) inhibits dihydrof olate reductase 
(DHFR) with a low picomolar Kj , and the interaction between the 

15 two has been extensively studied (11, 25) . In addition, our 
laboratory recently showed Mtx could be used successfully in a 
yeast three-hybrid system (7, 26) . For the other half of the 
bridging small molecule, we used SLF, available from Ariad 
Pharmaceuticals as an FK506 analog. SLF has nanomolar affinity 

20 for FKBP12 , and the interaction between the two has been fully 
characterized (27,28) . In addition, SLF homodimers have been 
used previously in several mammalian three -hybrid systems (2 9) . 

The retrosynthetic analysis of Mtx- SLF is shown in Figure 2. 

25 The synthesis is based on previous syntheses of Mtx and SLF 
derivatives and was designed to allow Mtx, SLF, or the linker 
between them to be varied readily. The Mtx portion of the 
molecule begins as the y-methyl ester of L-glutamic acid and is 
based on previous syntheses of Mtx (7,26,30) . y-Methyl l- 

30 glutamic acid is inexpensive, and the a - carboxyl ate can be 
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the Columbia University Ch :y Department NMR Facility. 

Spectra were determined in methanol -d, at 300 K with the proton 
or carbon (3.305; 49.05) as the reference or in chloroform-d at 
3 00K with proton (7.2 65) as the reference. *H NMR spectra are 
5 tabulated in the following order: chemical shift calculated with 
reference to solvent standards based on tetramethylsilane , 
multiplicity (s, singlet; d, doublet; t, triplet; m, multiplet; 
br, broad), coupling constant (s) in Hertz, and number of 
protons. 13 C NMR spectra were determined on the Bruker 300 MHz 

10 instrument and are proton decoupled- Mass spectra (MS) were 
recorded at the Columbia University Department of Chemistry mass 
spectral laboratory. Fast Atom Bombardment (FAB) high 

resolution mass spectra (HRMS) were recorded on a JMS-HX110A 
mass spectrometer. Low resolution electron spray ionization 

15 mass spectra (LRMS) were recorded on a J"MS-LC Mate mass 
spectrometer. Analytical thin layer chromatography (TLC) was 
performed on silica gel (Whatman LHPKF Silica Gel 6 OA) and 
visualized by UV light (254 nm) or stained by ninhydrin. All 
column chromatography was flash chromatography carried out on 

20 silica gel (EM Science Silica Gel 60) , and all eluants used are 
reported in volume : volume ratios. All moisture-sensitive 
reactions were performed under a positive pressure of nitrogen 
in flame- or oven-dried glassware. Organic extracts were dried 
over anhydrous sodium sulfate. Organic solvents were removed in 

25 vacuo with a rotary evaporator equipped with a vacuum pump (ca. 
1 torr) . Products were then left under vacuum (ca. 0.1 torr) 
overnight before analysis was performed. 
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selectively protected as th( :-butyl ester by transiently 

protonating the a-amino group (31) . The diprotected amino acid 
is then coupled to 4 - (methylamino) benzoic acid using standard 
peptide coupling reagents. Finally, the y-methyl ester is 
5 saponified to yield the free acid for further reactions. SLF 
acid was synthesized as described previously from L-pipecolinic 
acid in 59% yield for 6 steps (6,27) . The Mtx and SLF portions 
were then coupled to 1,10-diami nodecane in a three -component 
peptide coupling reaction. 2 , 4 -Diamino-6-bromomethyl-pteridine 

10 is added after this coupling reaction in order to simplify 
purification of the synthetic intermediates. Finally, acid 
cleavage of the tert-butyl ester yielded Mtx- SLF. Thus, the 
Mtx-SLF heterodimer was prepared from two components in 5% 
overall yield for the 6 steps from the y-methyl ester of l- 

15 glutamic acid or 6% overall yield in 9 steps from the l- 
pipecolinic acid precursor of SLF. 

The synthetic chemistry performed for the preparation of the 
Mtx-SLF is described below. 

20 

Synthetic Chemistry 

General Methods, Reagents were obtained from commercial 
suppliers and were used without further purification. All 
reagents for chemical synthesis were purchased from Aldrich. 

25 Anhydrous iV,iV-dimethylf ormamide and anhydrous methylene chloride 
were from Sure Seal™ bottles purchased from Aldrich. 
Methotrexate was a generous gift from the National Cancer 
Institute. Nuclear magnetic resonance (NMR) spectra were 
recorded on a Bruker 500 (500 MHz) , a Bruker 4O0 (400 MHz) or a 

30 Bruker 300 (300 MHz) Fourier Transform (FT) NMR spectrometer at 




Scheme I 

Synthesis of 2. (31) The y-methyl ester of L-glutamic acid (1) 
(5.02 g, 31.0 mmol) was added to a solution of 70% aqueous (aq.) 
5 perchloric acid (3.0 mL) in tert-butyl acetate (200 mL) . The 
resulting solution was stirred at room temperature (rt) .for 3 h 
during which time the acid dissolves completely. The reaction 
was then judged complete by thin layer chromatography (TLC) 
using 10:1 methylene chloride (CH 2 C1 2 ) -.methanol (MeOH) . The 

10 reaction mixture was extracted with 0.5 N aq. HC1 (4x, 400 mL) . 
The pH of the combined aqueous layers was adjusted to 8 using 
saturated aq. sodium carbonate. The aqueous solution was then 
extracted with ethyl acetate (EtOAc) (4x, 500 mL) . The organic 
extracts were combined, washed with brine (2x, 300 mL) , and 

15 dried over anhydrous sodium sulfate. Removing the solvent in 
vacuo gave 2 as a clear oil in 65% yield: R f = 0.45 in 10:1 
CH 2 Cl 2 :MeOH; *H NMR (400 MHz, CD3OD) 8 3.66 (s, 3), 3-33 (t, J = 
6.5 Hz, 1), 2.41 (m, 2), 1.95 (m, 1), 1.86 (m, 1) ; LRMS, m/z 
218.2 (MH + ) , 219.2 (MH 2 *) . 



20 



Synthesis of 3. (7,30) Conpound 2 (2.19 g, 10 . 0 *nmol) , 1,3- 
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dicyclohexylcarbodiimide (E 3.09 g, 15.0 mmol) , i- 

hydroxylbenzotriazole hydrate (HOBt, 2.43 g, 18.0 mmol) , and N- 
methyl-para-benzoic acid (1.59 g, io.5 mmol) were dissolved in 
anhydrous dimethyl formamide (DMF, 22 mL) under a nitrogen 
5 atmosphere. Diisopropylethylamine (DIEA , 0.1 mL, 0.5 mmol) was 
added to the solution, and the reaction mixture was stirred 
overnight (ON) at rt : After 16 hr, a 1:2:1 water : saturated aq. 
sodium bicarbonate : brine solution (500 mL.) was added to the 
reaction giving a yellow suspension. This solution was then 

10 extracted with EtOAc (4x, 300mL) . The fractions were combined, 
washed with brine (2x, 200mL) and dried over anhydrous sodium 
sulfate. The organic solvent was then removed in vacuo. The 
product is purified by silica gel column chromatography (2:1 to 
1:1 hexanes:EtOAc) in 76% yield: R f = 0.25 in 1:1 hexanes : EtOAc ; 

15 2 H NMR (500 MHz, CD 3 OD) 5 7.66 (d , J = 7.0 Hz, 2), 6.58 <d, J 
=7.0 Hz, 2), 4.59 (dd, J =9.5, 5.0Hz, 1), 3.65 (s, 3), 2.79 
(s, 3), 2.47 (t, J = 7.5 Hz, 2), 2.23 (m, 1), 2.05 <m, 1), 1.46 
( S/ 9); 13 C NMR (300 MHz, CD 3 OD) 8 175.1, 173.0, 170.6, X54 . 6 , 
130.2, 121.5, 111.9, 82.9, 54.3, 52.3, 31.4, 30.0, 28.3, 27.6; 

20 LRMS, m/z 351.2 (MH*) ; HRMS, m/z 351.1930 <MH*) , calculated 
351.1920. 

Synthesis of 4. Compound 3 (500 mg, 1.43 mmol) was dissolved in 
methanol (20 mL) . Lithium hydroxide monohydrate (120 mg, 2.86 

25 mmol) was dissolved in water. Both solutions were chilled in a 
O'C ice bath. The aqueous solution was added to the methanol 
solution all at once. The resulting solution was stirred at 0"C 
for 10 minutes and then allowed to warm to rt and stirred for an 
additional 80 minutes. Solvent is removed in vacuo until only 

30 a yellow gel remained with a volume about 1 mL. Water (20 mL) 
was added to the remaining reaction mixture. The solution was 
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acidified to pH = 2 with 1 N ; 1 (9 mL) and was immediately 

extracted with EtOAc (5x, 25 mL) . The organic extracts were 
combined, washed with brine (2x, 20 mL) , and dried over 
anhydrous sodium sulfate. The solvent was removed in vacuo to 
5 yield product 4 in 93% yield: R f = 0.05 in 1:1 EtOAc : hexanes and 
0.45 in 5:1 CH 2 Cl 2 :MeOH; a H NMR (400 MHz, CD 3 OD) 8 7.64 (d, J = 
7.0 H2, 2) , 6.56 (d, J = 7.0 Hz, 2) , 4.4 6 (dd f J = 9.0/ 5.0 Hz, 
1), 2.80 <s, 3), 2.44 (t, J = 7.5 Hz, 2) , 2.23 (m, 1), 2.05 <m, 
1), 1.47 (s, 9) ; J3 C NMR (300 MHz, CD 3 OD) 8 178.4, 174.8, 172.3, 
10 156.2, 130.7, 123.2, 113.6, 84.6, 57.0, 33.2, 31.8, 30.5, 29.1; 
LRMS, m/z 337.3 (MH*) ; HRMS, m/z 337.1751 (MH + ) , calculated 
337 .1763 . 




15 Synthesis of 7.(27) 



Scheme II 

Synthesized as reported in quant it at i ve 




WO 20114/042345 PCT/IIS200J/0 12<»12' 

-39- 

yield to give 7 as a ye__ solid: R f = 0.40 in 1:1 
EtOAc:hexanes; l H NMR (300 MHz, CDCl 3 ) 5 9.86 (s, 1), 7.61-7.55 
(m, 2) , 7.40-7.34 (m, 2), 7 .32 (d, J = 4.5 Hz, 1), 7.25-7.10 (m, 
2), 6.91-6.85 <m, 2), 3.95 (s, 3), 3.93 <s, 3); LRMS, m/z 284.9 
5 (MH*) . 

Synthesis of 8.(27) Synthesized as reported in near 

quantitative yield based on mass as a white crystalline solid 
(NMR revealed a small amount of non-hydrogenated starting 

10 material remains and is carried through to the next step. NMR 
integration is used to determine relative quantities of the two 
materials.): R f = 0.50 in 1:1 EtOAc : hexanes ; *H NMR (300 MHz , 
CDC1 3 ) ; 5 7.55-7.48 <m, 2), 7.35 (t, J = 8.0 Hz, 1), 7.25 (t, J 
= 8.5 Hz, 1), 7.10 (br d, J = 8.0 Hz, 1), 6.91-6.73 (m, 3), 3.90 

15 (s, 3), 3.88 (s, 3), 3.24 (t, J = 7.5 Hz, 2), 2.99 (t, J = 8.0 
Hz, 2); LRMS , m/z 287.0 (MH*) . 

Synthesis of 9 . (27) Synthesized as reported to give 9 as a 
clear oil in quantitative yield by mass: R f = 0.60 in 1:1 
20 EtOAc: hexanes; *H NMR (300 MHz, CDCI3) 5 7.59 (br d, 8.0 Hz, 1) , 
7.49 (m, 1), 7.37 (t, J = 8.0 Hz, 1), 7.14 (dd, J = 8.0, 2.5 Hz, 
1), 6.85-6.72 (m, 3), 4.60 (s, 2), 3.89 (s, 3), 3.86 (s, 3), 
3.27 (t, J * 7.1 Hz, 2), 3.04 (t, J = 7.5 Hz, 2), 1.50 (s, 9). 

25 Synthesis of 10.(27) Synthesized as reported to yield product 
10 in 20% yield fully purified. However, a 75% yield of 
unreacted starting material was also recovered. R £ = 0.40 in 5:1 
CH 2 Cl 2 :MeOH and 0.40 in 1:1 EtOAc : hexanes ; *H NMR (500 MHz, 
CD3OD) 8 7.28 (t, J = 8.0 Hz, 1), 6.93 (d, J = 7.5 Hz, 1), 6. €9 

30 (s, 1), 6.82 (d, J = 8.0 Hz, 1), 6.80-6.74 (m, 2), 6.70 (d, J = 
8.0 Hz, 1)/ 4.60-4.53 (m, 3), 3. SO (s, 3), 3.78 (s, 3), 2.65- 
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Scheme XXX 

5 Synthesis of 12.(28) Synthesized as reported to give 12 as a 
white crystalline solid in quantitative yield based on NMR 
integration: A H NMR (4 00 MHz, CD 3 OD) 5 4.03 (br d, J = 9.5 Hz, 
1), 3.83 (s, 3), 3.41 (br d, J m 12.0 Hz, 1), 3.04 <br t, J = 
11.0 Hz, 1), 2.27 (br d, J = 11.5 Hz, 1), 2.00-1.63 (m, 5). 

0 

Synthesis of 13. (28) Synthesized as reported to yield 13 as a 
clear oil in 86% yield: R f = 0.70 in 1:1 EtOAc rhexanes and 0.25 
in 1:4 EtOAc:hexanes; 2 H NMR (5€0 MHz, CD 3 OD) 8 5.15 (br d, J = 
5.0 Hz, 0.7), 4.62 (br d, J = 4.0 Hz, 0.3), 4.34 (br <3, J = 
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12.0 Hz, 0.3), 3.89 (s, 2.1). (s , 0 . 9) , 3 . 77 (s, 3) , 3.57 

(br d, J = 14.0 Hz, 0.7), 3.35 (m, 0.7), 2.91 (br t, J * 13.0 
Hz, 0.3), 2.35-2.24 (m, 1), 1.83-1.65 (m, 3) 1.55-1.35 (m, 2) 
This product exists as a 2.5:1 mixture of the trans and cis 
5 conformations. Further analysis by COSY allows us to make the 
following assignments of the two peaks for each proton in the 
structure: 5.15 and 4.62, 4.34 and 3.57, 3.89 and 3.84, 3.35 and 
2.91; LRMS, m/z 230.1 (MH,*) . 

Synthesis of 14.(28) Synthesized as reported to give 14 as a 
clear oil in 81% yield: R f = 0.85 in 1:1 EtOAc :hexanes and 0.50 
in 1:4 EtOAc:hexanes; l H NMR (500 MHz, CD 3 OD) 8 5.16 (br d, J = 
5.5 Hz, 0.8), 4.38 (br d, J = 11.5 Hz, 0.2), 4.24 (br d, J = 5.0 
Hz, 0.2), 3.75 (s, 3), 3.39 (br d, J = 13.0 Hz, 0.8), 3.23 (td, 
J =13.0, 2.8 Hz, 0.8), 2.91 (br t, J = 13.0 Hz, 0.2), 2.30 (br 
d, J • 14.0 Hz, 0.8), 2.22 (br d, J = 13.5, 0.2), 1.79-1.60 (m, 
5) 1.55-1.35 <m, 2), 1.23-1.13 (m, 6), 0.86 (t, J = 7.5 Hz, 3) 
This product exists as a 4:1 mixture of the trans and. c ± s 
conformations. Further analysis by COSY allows us to make the 
following assignments of the two peaks for each proton in the 
structure: 5.16 and 4.24, 4.38 and 3.39, 3.23 and 2.91, 2.30 and 
2.22; LRMS, m/z 270.2 (MH*) . 

Synthesis of 15. (28) Synthesized as reported to give the 
25 product 15 as a white crystalline material in 96% yield: R t = 
0.05 in 1:1 EtOAc : hexanes ; l H NMR (500 MHz, CDC1 3 ) 6 5.29 (br d, 
J « 5.5 Hz, 0.8), 4.50 (br d, J = 14.0 Hz, 0.2), 4.26 (br d, J 
= 4.5 Hz, 0.2) , 3.43 (br d, J = 13.0 Hz, 0.8) , 3.24 (td, J = 
12.5, 3.5 Hz, 0.8), 2.94 (br td, J = 13.0, 3 Hz, 0.2), 2.34 (br 
30 d, J = 12.0 Hz, 0.8), 2.24 (br d, J = 13.5, 0.2), 1.79-1.60 (m, 
5) 1.55-1.35 (m, 2), 1.23-1.13 (m, 6), 0.86 (t, J= 7.5 Hz, 3) 
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This product exists as a 4: .ture of the trans and c is 

conformations. Further analysis by COSY allows us to make the 
following assignments of the two peaks for each proton in the 
structure: 5.29 and 4.24, 4.50 and 3.43, 3.24 and 2.94, 2.34 and 
2.24. Compounds 16, 17, 18, and Mtx-SLF all have this same 4:1 
conformation pattern and appear nearly the same 
spectroscopically as compound 15 for these peaks. For 
simplicity's sake the 0.8 integration will be called 1 and the 
0.2 peak will be disregarded in the characterization for the 
rest of these compounds . 
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Scheme IV 

Synthesis of 16.(27) Synthesized as reported to give 16 as a 
colorless oil in 88% yield: R, « 0.15 in 4 :1 hexanes :EtOAc and 
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.0.65 in 2:1 EtOAc : hexanes ; *] (500 MHz, CD 3 OD) 6 7.28 (t, J 

= 8.0 Hz, 1), 6.96 (m, 1), 6.89 (s, 1), 6.85 (m, 2) , 6.78 (s, 
1), 6.71 (d, J = 8.0 Hz, 1), 5.73 (m, 1), 5.21 (br d, J = 5.5 
Hz, 1), 4.58 (s, 2),. 3.80.(8, 3), 3.78 (s, 3), 3.38 (br d, J = 
5 14.5 Hz, 1), 3.17 (td, J = 13.0, 3.0 Hz, 1), 2.65-2.52 <m, 2), 
2.32 (br d, J = 13.0 Hz, 1), 2.30-2.20 (m, 1), 2.05 (p, J = 7.0 
Hz, 1), 1.78-1.58 (m, 5), 1.46 (s, 9), 1.41-1.26 (m, 2), 1.23 
(s, 3), 1.21 (s. 3), 0.86 (t, J • 7.5 Hz, 3) (see note in 
Compound 15); LRMS , m/z 64 0.7 (MH-) . 

10 

Synthesis of 17.(27) Synthesized as reported to give 17 in 
quantitative yield. TLC analysis showed only one product and 
the acid was used without further purification: R t = 0.05 in 1:1 
EtOAc: hexanes and 0.3 5 in 10:1 CH,C1 2 : MeOH . 

15 

O 




Scheme V 

Synthesis of 18. Compound 4 (42.7 mg, 0.127 mmol) , compound 17 
(74.1 mg, 0.127 mmol), 1, 10-diaminodecane (20.7 mg, 0.120 mmol). 
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DCC (130 mg, 0.631 mmol), an ? (30.0 mg, 0.246 ramol) were 

dissolved in CH,C1, (3 ml.) under a nitrogen atmosphere and 
stirred at rt overnight. After 16 hours, the solvent was removed 
in vacuo. The product was purified by silica gel column 
chromatography (1:4 EtOAc : hexanes to pure EtOAc) to give a white 
solid in 30% yield: R f = 0.65 in EtOAc and 0.60 in 10:1 
CH 2 Cl 2 :MeOH; "H NMR (500 MHz , CD 3 0D) 8 7.64 (d, J = 7.0 Hz, 2), 
7.28 (td, J = 8.0, 3.0 Hz, 1) , 6.98 (m, 2) , 6.91 (d, J = 7.5 Hz, 
1), 6.85 (d, J = 8.0 Hz, 1), 6.76 (s, l) , 6.69 (d, J = 8.0 Hz, 

1) , 6.56 (d, J = 7.0 Hz. 2), 5.74 (m, 1), 5.21 (br d, J - 5.0 
Hz, 1), 4.51 (s, 2), 4.46 (m, 1), 3.80 (s, 3), 3.78 (s, 3), 3.38 

(br d, J = 12.5 Hz, 1), 3.23 (t, J = 7.0 Hz, 2) 3.17 (td, <J = 
13.0, 3-0 Hz, 1), 3.11 (t, J = 7.0 Hz, 2), 2.80 (s, 3), 2.65- 
2.52 (m, 2), 2.37-2.30 (m, 3), 2.30-2.15 (m, 2), 2.10-2.00 (m, 

2) , 1.78-1.58 (m, 4), 1.48 (s, 9), 1.54-1.38 (m, 5), 1.38-1.28 
(m, 2), 1.28-1.16 (m, 18) , 0.86 (t, J = 7.5 Hz, 3) (see note in 
Compound 15) . 

Synthesis of Mtx-SLF. (7,26,30) Compound 18 (40.0 mg, 38.0 ^mol) 
and the hydrobromide salt of . 2 , 4-diamino-6-bromomethyl pteridine 
(18 mg, 45 jxmol) were dissolved in N , N > -dimethyl acet amide (2 
mli) . The reaction mixture was stirred in a 5 0°C oil bath for 12 
hours. The intermediate product (R f = 0.50 in 10:1 CH 2 Cl 2 :MeOH) 
was purified by silica gel column chromatography (30:1 to 10:1 
CHjCl 2 :MeOH) . The crude product was dissolved in trif luoroacetic 
acid (3 mli) at 0°C for 5 minutes and allowed to warm to rt and 
stirred at rt for 1 hour. Toluene (3x, 50 mL) was added to the 
reaction mixture, and all solvent was removed in vacuo. After 
removal of solvents, Mtx-SLF was purified by preparative thin 
layer silica gel chromatography (5:1 CH 2 Cl 2 :MeOH, 4x) to give a 
yellow solid in 33% yield (for two steps): R f = 0.15 in 3:1 
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CH 2 Cl 2 :MeOH; >H NMR (500 MHz, 5 8.55 (s, 1), 7.73 (d, J = 

8.0 Hz, 2) , 7.28 (t, J = 7.5 Hz, 1) , 6.98 (tn, 2) , 6.91 (br d, J 
= 7.5 Hz, 1), 6.85-6.79 (tn, 3), 6.76 (s, 1), 6.69 (d, J = 6.5 
Hz, 1), 5.73 (m, 1), 5.21 (br s, 1), 4.75 (s, 2), 4.50 (s, 2), 
5 4.46 (m, 1) , 3.80 (s, 3) , 3.78 <s, 3) , 3.38 (br d, J 13.0 Hz, 
1), 3.25-3.10 (m, 6), 3.05 (t/ J - 7.0 Hz, 2), 2.65-2.52 (m, 2), 
2.37-2.15 (m, 5), 2.10-2.00 <m, 2), 1.78-1.58 (m, 4), 1.50-1.42 
(m, 3), 1.38-1.32 (m, 3), 1.35-1.25 (m, 4 ) , 1 . 25-1 . 16 (m, 15), 
0.86 (t, J = 7.5 Hz, 3) (see note in Compound 15). 

10 

Small molecule concentration calibration. The small molecules 
were dissolved in DMF to concentrations of 10 mM for Mtx and 12 
mM for the Mtx-SLF molecule. The concentrations of Mtx and Mtx- 
SLF were determined by Beer's law using an extinction 
15 coefficient of e = 6700 cm^M' 1 (calculated from a known solution 
of Mtx in DMF) for Mtx-SLF. Solutions of compound 16 (SLF-OtBu) 
were prepared on a sufficient scale to mass 16 accurately. All 
small molecules were stored under a nitrogen atmosphere at -80°C 
and allowed to come to rt before use. 

20 

Part II - Construction fo E-coll strain. 

The next step was the construction of the E. coli strain 
expressing the kd-FKBP12 and aNTD-DHFR fusion proteins and 
containing the lacZ reporter construct . Plasmids encoding the 

25 A.cl -FRBP12 and ctNTD-DHFR chimeras were prepared from vectors 
pACXcI32 and pBRaliN using standard molecular biology techniques 
(8) . The same synthetic lac£ reporter, placOR2-62, as initially 
reported by the Hochschild lab was used. The reporter placOR2- 
62 is maintained in one copy in the chromosome as a prophage and 

30 encodes the lacZ gene 62 bp downstream from the Xcl operator 
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(Figure 1) (16) . Based on pre. _ results from Kopytek and Hu 

showing that tolC~ and thyA" mutations improved the viability an d 
tolerance of E.coli to Mtx-based molecules, we expected export 
as well as toxicity of Mtx-SLF to be problematic in E. call 
5 (32) . Thus, we modified the original Hochschild strain KS1 to 
be tolC in order to decrease active export of our small 
molecule. At the low concentrations of Mtx-SLF required for the 
three-hybrid experiments, however, Mtx was not sufficiently 
toxic to warrant the thyA~ mutation. We introduced the tolC' 
10 mutation into KS1 via a Plvir transduction from strain SK037 
(32) . We call this test strain V674E. Transformation of the 
plasmids bearing the various ^cl and a NTD fusion proteins into 
V674E yielded the final experimental strains. 

15 The molecular biology performed for the preparation of the 
strain is described below. 

General methods . Restriction enzymes, Vent DNA polymerase and T 4 
DNA ligase were purchased from New England Biolabs . The dNTPs 

20 

used in the Polymerase Chain Reation (PCR) were purchased from 
Pharmacia Biotech. Oligonucleotides were purchased from The 
Great American Gene Company ( www . geneco .com) m The bacto-agar, 
tryptone-peptone and bacto-yeast extract were purchased from 
DIFCO. Falcon 14 ml> culture tubes were used for growing 

25 bacteria. Corning Costar 96-well plates with V-shaped wells 
used for growing the bacteria for solid media plate assays . Th e 
phrog used to transfer cells into 96-well plates or onto petri 
plates containing agar media was purchased from Dan-Kar Corp. 
(Wilmington, MA) . 5-Bromo-4-chloro-3-indolyl- P-D- 

30 galactopyranoside (X-gal) for the plate assays was purchased 
from Diagnostic Chemicals (Oxford, CT) . o-Nitrophenyl-P~D- 
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15 



galactopyranoside (ONPG) for liquid assays was purchased 

from Sigma. Methotrexate was from the National Cancer 

Institutes (NCI). All other chemicals were purchased from 
Aldrich or Sigma. All aqueous solutions were made with 
distilled water prepared from a Milli-Q Water Purification 
System. For Polymerase Chain Reaction (PCR), a MJ Research PTC - 
200 Pellier Thermal Cycler was employed. The transformation of 
E. coli was carried out by electroporation using a Bio-Rad E. 
coli Pulser. UV/Vis spectra and absorbances were taken using a 
Perkin-Elmer Lambda 6 Spectraphotometer . Restriction digests 
were carried out as recommended by New England Biolabs. 
Sequencing of the receptor domains of all plasmids constructed 
was performed by Gene Wiz (New York, NY). All other standard 
molecular biology techniques were carried out essentially as 
described (33-35) 



20 



The bacterial two-hybrid plasmids, P AC^cl32, pBR a IiN, pACLGF2 , 
and pBR a Galll P , and £. coli strain KS1 were obtained (8,21). 
PI vir and E. coli strain SK037 were obtained (32); and plasmid 
pJG-FKBP12 was obtained (2). Map of pBR&LN with TEM1 follows. 
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Construction of XcI-FKBP12 anu J-DHFR protein chimeras. The 

gene encoding dihydrof olate reductase (DHFR) from E. coli was 
subcloned from plasmid pMW102eDHFR to pBRaLN (7,26). A 531 bp 
Jfotl to BamHI fragment was prepared by polymerase chain reaction 
(PCR) from pMW102eDHFR (26) using the primers 5 1 -ACT TCA GGT GCG 
GCC GC A GGC TCG GGC GGC TCG GGC GGC GGA GTG CAG GTG GAA AC- 3' 
(5' primer VWC482, flanked by a NotI site (underlined) and a 
Gly-Ser-Gly-Gly-Ser-Gly-Gly linker) and 5'-TGT ATC AAC S£A_TCC 
TTA ATG GTG ATG GTG ATG GTG CGA GCC GAA TTC TTC CAG TTT TAG AA- 
3» (3 ' primer VWC487, flanked by a BamHI site (underlined) and 
a 6 His tag followed by a stop codon) . This fragment was 
inserted between the J^otl site and the BamHI site in pBRotLiN to 
give to pBRotLNeDHFR . The region generated by PCR was verified 
by DNA sequencing using primers VWC62 8, 5'-CTG GCT GAA CAA CTG 
GAA GC, and VWC629, 5' -ATA TAG GCG CCA GCA ACC GC. 



The gene encoding FKBP12 was subcloned from plasmid pJG-FKBP12 
to pACXcI32. A 3 84 bp Wot I to Asc I fragment was prepared by 
polymerase chain reaction (PCR) from plasmid pJG-FKBP12 using 
20 the primers 5 ' -ACT TCA GGT ^Pfi GCC GCA GGC TCG <5GC GGC TCG GGC 
GGC GGA GTG CAG GTG GAA AC- 3 ' (5' primer VWC611, flanked by a 
NotZ site (underlined) and a Gly-Ser-Gly-Gly-Ser-Gly-Gly linker) 
and 5'- TGT ATC AAC GGC GCG CCT TAA TGG TGA TGG TGA TGG TGC GAG 
CCG AAT TCT TCC AGT TTT AGA A- 3 ' (3' primer VWC612, flanked by 
25 a AscI site (underlined) and a 6 His tag followed by a stop 
codon) . This fragment was inserted between the Notl site and 
the AscI site in plasmid pACXcI32 to give pACkcIFKBP12 . The 
region amplified by PCR was verified by DNA sequencing using 
primers VWC626, 5'-CCC AAT GAT CCC ATG CAA TG, and VWC627 , S'- 
30 GCG CTT CGT TAA TAC AGA TG. 
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Construction of bacterial stru-.*. 



»-\35 f 36) Strain V674E was made 



from KS1 via a Plvir transduction from SK037 essentially as 



infection and 60 minutes after addition of citrate in order to 
accommodate slower growing times for the mutated strain. The 
infected cells are then plated on LB plates containing 12 Hg/mL 
tetracycline and 20 mM sodium citrate and incubated at 37*C 
overnight. In addition, the positive transductants are verifie d 
by streak purification on an LB plate with 12 M-g/mL 
tetracycline, 100 ^g/mL kanamycin, and 20 mM sodium citrate to 
give strain V674E. 

Electrocompetent cells of strain V674E were made by standard 
methods (35); however, the transformation efficiency of this 
strain is low (lO 6 -!©" 7 ) due to the tolC mutation. Using standard 
methods, both the pAC^cI32 and the pBR a LN plasmids or 
appropriate derivatives were transformed into the strain. 
Transf ormants were grown on LB with 100 Mg/mL ampicillin and 6 
M^g/mL chloramphenicol. Note: Chloramphenicol is also a 

substrate for tolC so levels above 8 M-g/mL significantly impair 
growth and viability of tolC~ strains. In addition MacConkey 
plates can't be used because MacConkey media is toxic to strains 
bearing a tolC~ mutation. * 

LacZ assays. (7, 20, 35) The bacterial strains were stored as 20% 

© 

glycerol stocks in 96-well plates at -80 C. To do the plate 
assay, the bacterial strains were first phroged to LB liquid 
media containing 100 ^g/mL ampicillin and 6 Hg/mL chloramphenicol 
in a 96-well plate and then incubated in a 37 °C shaker overnight . 
These saturated cultures were used to innoculate a second 96- 



described - 



Incubation times were increased to 30 minutes for 
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well plate containing fresh iquid media with 100 ng/mL 

ampicillin, 6 ng/mL chloramphenicol, and 0.5 mM IPTG using the 
phrog. These cultures were grown at 37°C shaking at 240 rpm for 
4-5 hours until the cultures were just beginning to be turbid. 

5 These cultures were then phrogged to X-gal plates and X-gal 
plates with varying concentrations of Mtx-SLF and incubated at 
37°C. The X-gal plates are LB solid media plates containing 40 
Hg/mL X-gal, 0.5 mM IPTG, 0.5 tPEG (a |3-galactosidase inhibitor), 
100 |ig/ml» ampicillin and 6 |ig/mL chloramphenicol. X-gal 

10 hydrolysis was first visible at about 8 hours and increased 
until about 18 hours of growth at which point it appeared to 
reach a maximum. 

For the liquid assays , the strains were incubated overnight in 
15 LB liquid media containing 100 ^ig/mL ampicillin, 6 *ig/mL 
chloramphenicol, and 0 . 5 mM IPTG overnight. These cultures were 
used to innoculate 1000 fold into LB liquid media containing 
lOOfig/mL ampicillin, 6*ig/mL chloramphenicol, 0.5 mM IPTG, and 
small molecule (per concentration indicated in each experiment) 
20 and grown to a final OD 600 of 0.4-0.6 in a 37°C shaker which took 
about 5 hours. No tPEG is used in the liquid assays in order to 
get a true quantization of the transcription. Then, the 
protocol from Miller was followed exactly using 0.1 mL <^ell 
lysate for the assay (35) . 

25 

Part: III - Screening 

Using standard J3-galactosidase assays on plates (35) , we 
established that Mtx-SLF activates transcription of the la.cZ 
reporter gene in JET. coli strain V674E. For the plate assays, 
30 mid-log phase liquid cultures of the strains were transferred to 
a luria broth (LB) plate containing Mtx-SLF, 5-bromo-4-chloro-3- 



1 
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ndolyl-P-D-galactoside iA-gal) , isopropyl - P- D - 

thiogalactopyranoside (IPTG) , phenylethyl-P-D-thiogalactoside 
(tPEG, a P-galactosidase inhibitor) , and the appropriate 
antibiotics. The plates were then incubated for 18 hours at 
37 C. The concentrations of . Mtx-SLF were varied between 0.1 Hm 
and 10 (not all shown) . On plates containing Mtx-SLF, we 

observed a substantial increase in the levels of P-galactosidase 
synthesis over those due to basal transcription (Figure 3) . 
Importantly, this increase is d ependent on both DHFR and FKBP12 
and correlates with the concentration of Mtx-SLF in the plate 
media. Also, it is interesting to note that at 10 pM Mtx-SLF the 
levels of small-molecule induced transcription in the three- 
hybrid strain appears to be greater than those induced by the 
direct protein-protein interaction in the two-hybrid control. 

These X-gal plate results were confirmed using more quantitativ e 
liquid P-galactosidase assays. The liquid assays were carried 
out essentially as originally described by Miller (35) . 
Overnight cultures were used to innoculate fresh LB media 
containing Mtx-SLF, IPTG, and the appropriate antibiotics. 
These cultures were grown, lysed, and assayed for P-galactosidase 
activity using o-nitrophenyl-P-D-galactopyranoside (ONPG) , a 
chromogenic substrate for P-galactosidase. We once again 
observed small-molecule dependent transcriptional activation 
(Figure 4). Cells expressing ^-cI-FKBP12 and a NTD-DHFR showed 6- 
fold activation at 1 ^ and 10-fold activation at 10 M** Mtx-SLF 
relative to cells expressing only ^cl and a NTD. For comparison , 
the levels of transcription in cells expressing ^d-Gal4 and 
a NTD-Galll p are 13-fold those of cells with ^cl and a NTD and both 
are unaffected by the concentration of Mtx-SLF in the media. As 
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seen in Figure 5, the levels u^anscriptional activation in 

the three-hybrid system correlate with the concentration of Mtx - 
SLF in the media. We begin to see transcriptional activation a t 
0.3 Mm Mtx-SLF, and the levels of activation are still increasing 
5 at 10 Mm Mtx-SLF. At higher concentrations, Mtx-SLF begins to 
be toxic to the JET. coll cells. Interestingly, at 10 Mm Mtx-SLF, 
the levels of transcriptional activation in the three-hybrid 
strain approach those in the two-hybrid strain. We attribute 
this sensitivity to the picomolar affinity of the Mtx/DHFR 

10 interaction, although this interpretation has not been proven. 
Again several independent controls establish that 
transcriptional activation indeed requires both halves of Mtx- 
SLF (Figure 4). Neither Mtx, SLF, nor a combination of the two 
affects the levels of transcription in the three-hybrid system. 

15 At 1 Mm Mtx-SLF, a 10-fold excess of either Mtx or the tejrt-butyl 
ester of SLF decreased the levels of transcription to about hal f 
that with 1 Mm Mtx-SLF alone. Deletion of either DHFR or FKBP1 2 
from the ^cI-FKBP12 and a NTD-DHFR fusion proteins drops the 
levels of small-molecule induced transcriptional activation to 

20 the background levels observed with only ^cl and a NTD. 

DISCUSSION 

The bacterial small-molecule th ree-hybrid system described here 
provides a robust platform for high-throughput assays based on 

25 protein-small-molecule interact ions . The Mtx-SLF heterodimeric 
ligand can be prepared readily and gives a strong transcription 
readout in the E. coll RNA polymerase three-hybrid system. 
Notably, the levels of transcri ptional activation with the Mtx- 
SLF three-hybrid system are comparable to those with the Gal4- 

30 Galll p interaction, despite the fact that one nbn-oovalent 
interaction has been replaced with two. This result may speak 



WO 2OIM/04234S 



I»CT/1IS2<NI3/012<»I2 



-53- 

to the importance of the part_ ^~rly high affinity between Mtx 
and DHFR. 

By adapting bacteria to allow for the use of the Mtx/ DHFR 
5 interaction, the described three-hybrid system that uses 
Mtx/DHFR is far superior to. previous bacterial systems of any 
type. For example, while it is a different system and designed 
for different purpose, the system described in (9) provides a 
context in which the described three-hybrid system based on 

10 Mtx/DHFR can be analyzed. The concentrations of small molecule 
in (9) vary between 50 and 350 uM with 250uM being the first 
concentrations at which activity is conclusive. The described 
Mtx/DHFR system does not need higher than lOuM and results can 
be detected absolutely at concentrations below luM. In 

15 addition, the described Mtx/DHFR system shows a 10— fold increas e 
in signal upon addition of smal 1 molecules whereas in (9) shows 
a signal to noise ratio of at most 1.25. In fact, the writers 
of (9) admitted that screening a library would be very difficult 
with their system, as shown by the fact that when screening a 

20 library for activity, they obtain 6 false positives and only a 
single true positive hit. In a ddition, the system described in 
(9) requires several different fluorescent dyes in order to 
maintain signal stability. Thus, the described bacterial 

three-hybrid system using Mtx/DHFR is far superior for screening 

25 libraries because of the far lower concentrations of small 
molecule requirements due to much stronger interactions, as wel 1 
as a vastly improved signal to noise ratio. 

Three-hybrid systems provide an In vivo alternative to affinity 
30 chromatography that can be used to evolve proteins that 
recognize a particular small molecule, to screen a library of 
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small molecules based on bindi w a particular protein, or to 
screen cDNA libraries to find the protein targets of drugs or t o 
classify proteins based on their small-molecule interactions. 
Because of the high transformation efficiency and rapid doublin g 

5 time of E- coll, this system should increase the number of 
proteins that can be tested in three-hybrid assays by several 
orders of magnitude compared with yeast systems. A bacterial 
assay should be particularly advantageous in molecular evolution 
experiments where on the order of 10 8 variants may be necessary 

10 to alter protein function. Based on our results, we believe Mt x 
will provide a versatile anchor for presenting a variety of 
different small molecules. 

The yeast three-hybrid assay provided a method for In vivo 
15 affinity chromatography that greatly simplifies protein 
identification and amplification at the end of affinity panning . 
The bacterial system described here should increase the number 
of protein variants that can be assayed by several orders of 
magnitude. 

20 
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What is claimed is: 

1. A transgenic bacterial cell comprising 

(a) a dimeric small molecule which comprises a first moiet y 
known to bind a first receptor domain covalently linked to 
a second moiety capable of binding a second receptor 
domain, wherein the first and second moieties are 
different; 

(b) nucleotide sequences which upon transcription encode 

i) a first fusion protein comprising the first 
receptor domain, and 

ii) a second fusion protein comprising the second 
receptor domain; and 

(c) a reporter gene wherein expression of the reporter 
gene is conditioned on the proximity of the first fusion 
protein to the second fusion protein. 

2. The bacterial cell of claim 1, wherein the dimeric small 
molecule has the formula: 

HI - Y - H2 

wherein each of HI and H2 may b e the same or different and 
capable of binding to a receptor which is the same or 
different with a IC 5 o of less than 100 MM; and 

wherein Y is a linker which may be present or absent - 

3. The bacterial cell of claim 2, wherein HI or H2 is capable 
of binding to a receptor with a IC 5 o of less than 10 MM. 



The bacterial cell of claim 2, wherein HI or H2 is capable 
of binding to a receptor with a IC 5 o of less than 1 MM. 
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12. The bacterial cell of c wherein the dimeric small 

molecule has the structure: 



x-X 

HjN N N 




OMc 



OMe 




wherein n is an integer from 1 to 20. 

13. The bacterial cell of claim 12, wherein n is an integer 
from 2 to 12 . 

14. The bacterial cell of claim 12, wherein n is an integer 
from 3 to 9- 

15. The bacterial cell of claim 12, wherein n is 8 . 

16. The bacterial cell of claim 1, wherein the first fusion 
protein further comprises a DNA binding domain, and the 
second fusion protein further comprises a transcription 
activation domain. 



17. The bacterial cell of claim 1, wherein the first fusion 
protein further comprises a transcription activation 
domain, and the second fusion protein further comprises a 
DNA binding domain. 



18. The bacterial <rell of claim 16 or 17, wherein the 
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5. The bacterial cell of claim 2, wherein HI or H2 is capable 
of binding to a receptor with a IC so of less than 100 nM. 

6. The bacterial cell of claim 2, wherein HI or H2 is capable 
of binding to a receptor with a IC so of less than 10 nM. 

7. The bacterial cell of claim 2, wherein HI or H2 is capable 
of binding to a receptor with a IC 50 of less than 1 nM. 

8. The bacterial cell of claim 2, wherein HI or H2 ±s a 
methotrexate moiety, FK506 moiety, an FK506 analog, a 
tetracycline moiety, or a cephem moiety. 

9. The bacterial cell of claim 8, wherein HI or H2 is a 
methotrexate moiety. 

10. The bacterial cell of claim 8, wherein HI or H2 is an FK50 6 
analog. 

11. The bacterial cell of claim 10, wherein the FK506 analog 
has the structure: 



OMe 
OMe 
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transcription activation .n is a NTD. 



19. The bacterial cell of claim 16 or 17, wherein the DNA- 
binding domain is ^-cl, AraC, LexA r Gal4, or zinc fingers. 

20. The bacterial cell of claim 1, wherein the first or the 
second receptor domain is that of dihydrof olate reductase 
("DHFR"), glucocorticoid receptor, FKBP12, FKBP mutants, 
tetracycline repressor, or a penicillin binding protein. 

21. The bacterial cell of claim 20, wherein the DHFR is the 
E.coll DHFR ("eDHFR") . 

22. The bacterial cell of claim 1, wherein the first fusion 
protein is DHFR-^cI or FKBP12-^cl . 

23. The bacterial cell of claim 1, wherein the second fusion 
protein is DHFR-<*NTD or FKBP12- a NTD. 

24. The bacterial cell of claim 1, wherein the reporter gene i s 
Lac Z, araBAD, aadA (spectinomycin resistance) , his3, P- 
lactamase, GFP, luciferase, Tet R (tetracyclin resistance) , 
KanR (kanamycin resistance), Cm (chloramphenicol 
resistance).. 



25. The bacterial cell of claim 
is Lac Z. 

26. A transgenic bacterial cell 
(a) a dimeric small molecule 

moiety covalently linked to 



24, wherein the reporter gene 
comprising 

which comprises a methotrexat e 
a moiety capable of binding a 
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receptor domain; 



(b) nucleotide sequences which upon transcription encode 



i) a first fusion protein comprising a DHFR domain, 



and 



ii) a second fusion protein comprising the receptor 



domain; and 



(c) a reporter gene wherein expression of the reporter 
gene is conditioned on the proximity of the first fusion 
protein to the second fusion protein. 



. 27 . 



The bacterial cell of claim 26 wherein the moiety 



known to 



bind a receptor domain is capab le of binding to a receptor 
with a IC 5 o of less than 100 nM. 

28. The bacterial cell of claim 26/ wherein the moiety known t o 
bind a receptor domain is capab le of binding to a receptor 
with a IC50 of less than 10 nM. 

29. The bacterial cell of claim 26, wherein the moiety known t o 
bind a receptor domain is capab le of binding to a receptor 
with a IC50 of less than 1 nM. 

30. A method for identifying a molecule that binds a known 
target receptor in a bacterial cell from a pool of 
candidate molecules, comprising: 

(a) forming a dimeric molecule by covalently bonding each 
molecule in the pool of candidate molecules to a ligand 
capable of selectively binding to a receptor; 

(b) introducing the dimeric mol ecule into a bacterial cell 
culture comprising bacterial cells that express a first 
fusion . protein which comprises the known target receptor 
domain against which the candidate molecule is screened, 
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a second fusion protein v comprises the receptor domai n 

to which the ligand selectively binds, and a reporter gene 
wherein expression of the reporter gene is conditioned on 
the proximity of the first fusion protein to the second 
fusion protein; 

(c) permitting the dimeric molecule to bind to the first 
fusion protein and to the second fusion protein, bringing 
the two fusion proteins into proximity so as to activate 
the expression of the reporter gene; 

(d) selecting the bacterial cell that ; expresses the 
reporter gene; and * 

(e) identifying the small molecule that binds the known 
target receptor. 

31. The method of claim 30, wherein the steps (b)-(e) of the 
method are iteratively repeated in the presence of a 
preparation of random small molecules for competitive 
binding with the screening molecule so as to identify a 
molecule capable of competitive ly binding the known target 
receptor . 

32. The method of claim 30, wherein the dimeric molecule is 
obtained from a combinatorial library. 

33. The method of claim 30, wherein the dimeric molecule 
comprises a ligand capable of selectively binding to a 
receptor with a IC 5 o of less than 100 I^M . 

34. The method of claim 33, wherein the dimeric molecule 
comprises a ligand capable of selectively binding to a 
receptor with a IC 5 o of less than 100 nM. 
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35. The method of claim 3 erein the dimeric molecule 
comprises a ligand capable of selectively binding to a 
receptor with a IC 50 of less than 1 nM. 

36. The method of claim 30, wherein the dimeric molecule 
comprises a methotrexate moiety, FK506 moiety, an FK506 
analog, a tetracycline moiety, or a cephem moiety. 

37. The method of claim 30, wherein the dimeric molecule 
comprises a methotrexate moiety. 

38. The method of claim 30, wherein the dimeric molecule 
comprises an FK506 analog . ■ 

39. The method of claim 38, wherein the FK506 analog has the 
structure : 

.OMe 
OMe 




40. The method of claim 30, wherein the first fusion protein 
further comprises a DNA binding domain, and the second 
fusion protein further comprises a transcription activatio n 
domain . 



41. 



The method of claim 30, wherein the first fusion protein 
further comprises a transcription activation domain, and 
the second fusion protein further comprises a DNA binding 
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domain. 

42. The method of claim 40 or 41, wherein the transcription 
activation domain is a NTD. 

43. The method of claim 4 0 or 41, wherein the DNA-binding 
domain is ^cl, AraC, LexA, Gal4, or zinc fingers. 



44. The method of claim 30, wherein the first or the second 
fusion protein comprises a receptor domain of dihydrofolat e 
reductase { ^DHFR" ) , glucocorticoid receptor, FKBP12, FKBP 
mutants, tetracycline repressor, or a penicillin binding 
protein 

45. The method of claim 44, wherein the DHFR is the E.coll DHFR 
("eDHFR") . 

46. The method of claim 30, wherein the first fusion protein i s 
DHFR-^cI or FKBP12-^cI. 

47. The method of claim 30, wherein the second fusion protein 
is DHFR-<*NTD or FKBP12- a NTD. 

48. The method of claim 30, wherein the reporter gene is lac Z 9 
araBAD, aadA, his3 f P-lactamase, GFP, lucif erase, TetR, 
KanR, Cm. 



49. The method of claim 4 8, wherein the reporter gene is I»ac Z. 

50. A method for identifying an unknown target receptor to 
which a known molecule is capable of binding in a bacteria 1 
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cell, comprising: 

(a) providing a dimeric molecule having a first ligand 
which has a specificity for the unknown target receptor 
covalently bonded to a second ligand capable of selectivel y 
binding to a receptor; 

(b) introducing the dimeric mol ecule into a bacterial cell 
which expresses a first fusion protein which comprises the 
unknown target receptor domain, a second fusion protein 
which comprises the receptor domain to which the second 
ligand selectively binds, and a reporter gene wherein 
expression of the reporter gene is conditioned on the 
proximity of the first fusion p rotein to the second fusion 
protein; 

(c) permitting the dimeric molecule to bind to the first 
fusion protein and to the second fusion protein so as to 
activate the expression of the reporter gene; 

(d) selecting which bacterial cell expresses the unknown 
target receptor; and 

(e) identifying the unknown target receptor. 

51. The method of claim 50, wherein the unknown target receptor 
is encoded by a DNA from the group consisting of 
genomicDNA, cDNA and syntheticDNA . 

52. The method of claim 50, wherein the steps (b)-(e) of the 
method are iteratively repeated so as to identify the 
unkown target receptor. 

53. The method of claim 50, wherein the dimeric molecule 
comprises a ligand capable of selectively binding to a 
receptor with a IC 5 o of less than 100 Wtf. 
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54. The method of claim j5 . , ....erein the dimeric molecule 
comprises a ligand capable of selectively binding to a 
receptor with a IC 50 of less than 100 nM. 

55. The method of claim 50, wherein the dimeric molecule 
comprises a ligand capable of selectively binding to a 
receptor with a IC 5 o of less than 1 nM. 

56. The method of claim 50, wherein the dimeric molecule 
comprises a methotrexate moiety, FK506 moiety, an FK506 
analog, a teracycline moiety, or a cephem moiety. 

57. The method of claim 50, wherein the dimeric molecule 
comprises a methotrexate moiety. 

58. The method of claim 50, wherein the dimeric molecule 
comprises an FK506 analog. 

59. The method of claim 58, wherein the FK506 analog has the 
Structure: 

OMe 
OMe 




° °y N 



60. The method of claim 50, wherein the first fusion protein 
further comprises a DNA binding domain, and the second 
fusion protein further comprises a transcription activatio n 
domain . 
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61. The method of claim 50, wherein the first fusion protein 
further comprises a transcription activation domain, and 
the second fusion protein further comprises a DNA binding 
domain . 

62. The method of claim 60 or 61, wherein the transcription 
activation domain is a NTD. 

63. The method of claim 60 or 61 , wherein the DNA-binding 
domain is ^-cl, AraC, LexA, Gal4, or zinc fingers. 

64. The method of claim 50, wherein the first or the second 
fusion protein comprises a receptor domain of dihydrof olat e 
reductase (^DHFR"), glucocorticoid receptor, FKBP12, FKBP 
mutants, tetracycline repressor, or a penicillin binding 
protein. 

65- The method of claim 64, wherein the DHFR is the E.coll DHFR 
("eDHFR") . 

66. The method of claim 50, wherein the first fusion protein i s 
DHFR-^cI or FKBP12-^cI. 

67. The method of claim 50, wherein the second fusion protein 
is DHFR-<*NTD or FKBP12- a NTD . 

68. The method of claim 50, wherein the reporter gene is lac Z f 
araBAD, aadA, his3, P-lactamase, GFP, luciferase, TetR, 
KanR, Cm. 
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69. The method of claim 68, i n the reporter gene is Lac z\ 



70. 



A kit for identifying a molecule that binds to a known 
target in a bacterial cell from a pool of candidate 
molecules, comprising 

(a) a host bacterial cell containing a reporter gene that 
is expressed only when bound to a DNA-binding domain and 
when in the proximity of a transcription activation domain ; 

(b) a first vector containing a promoter that functions in 
the host bacterial cell and a DNA encoding a DNA-binding 
domain; 

(c) a second vector containing a promoter that functions in 
the host bacterial cell and a D NA encoding a transcription 
activation domain; 

(d) a dimeric small molecule wh ich comprises a moiety that 
binds to a known receptor domain covalently linked to a 
candidate molecule; 

(e) a means for inserting into the first vector or the 
second vector a DNA encoding the known receptor domain in 
such a manner that the known receptor domain and an 
expression product of the first or second vector are 
expressed as a fusion protein; 

(f) a means for inserting into the first vector or the 
second vector a DNA encoding the known target receptor in 
such a manner that the known target receptor and an 
expression product of the first or second vector are 
expressed as a fusion protein; and 

(g) a means for transfecting the host cell with the first 
vector, and the second vector, wherein binding of the 
dimeric small molecule to the known target receptor result s 
in a measurably greater expression of the reporter gene 
then in the absence of such binding. 
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71. A kit for identifying ai own target receptor to which 

a molecule is capable of binding in a bacterial cell, 
comprising 

(a) a host bacterial cell containing a reporter gene that 
is expressed only when bound to a DNA-binding domain and 
when in the proximity of a transcription activation domain ; 

(b) a first vector containing a promoter that functions in 
the host bacterial cell and a DNA encoding a DNA-binding 
domain; 

(c) a second vector containing a promoter that functions in 

ithe host bacterial cell and a DNA encoding a transcription 
activation domain; 

(d) a dimeric small molecule wh ich comprises a moiety that 
binds a known receptor domain covalently linked to a moiety 
against which the unknown target is to be screened for 
binding; 

(e) a means for inserting into the first vector or the 

second vector a DNA encoding the known receptor domain in 

f such a manner that the known receptor domain and an 

f * 

expression product of the first or second vector are 

f expressed as a fusion protein; 

(f ) a means for inserting into the first vector or the 
second vector a DNA encoding the unknown target receptor in 
such a manner that the unknown target receptor and an 
expression product of the first or second vector are 
expressed as a fusion protein; and 

(g) a means for transfecting the host cell with the first 
vector, and the second vector, wherein binding of the 
dimeric small molecule to the unknown target receptor 
results in a measurably greater expression of the reporter 
gene then in the absence of such binding. 

L 
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72. A process for screening .. deal library for a molecule 

that binds a known target receptor, comprising providing a 
chemical library, providing a bacterial cell that expresse s 
the known target receptor as a part of a fusion protein, 
and identifying the molecule that binds the known target 
receptor in the bacterial cell by the method of claim 30. 

A molecule identified by the method of claim 72, wherein 
the molecule was not previously known to bind the known 
target receptor. 

A process for screening a cDNA library for a nucleic acid 
that encodes a receptor to which a known molecule binds, 
comprising providing a cDNA library, providing a dimeric 
molecule having two ligands, of which one ligand is the 
known molecule, and identifying the nucleic acid that 
encodes the receptor to which the known molecule binds by 
the method of claim 50. 

An isolated nucleic acid identified by the process of claim 
14, wherein the isolated nucleic acid was not previously 
known to encode a receptor to which the known molecule 
binds. 

76. The isolated nucleic acid of claim 75, wherein the isolate d 
' nucleic acid encodes an enzyme or a portion thereof . 

77. A transgenic bacterial cell comprising 

(a) a dimeric small molecule which comprises a methotrexat e 
moiety covalently linked to a moiety capable of binding a 
receptor domain; 

(b) nucleotide sequences which upon transcription encode 



73 . 



14 



75. 
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i) a first fusion p comprising a DHFR domain an d 
a first fragment of an enzyme, and 

ii) a second fusion protein comprising the- receptor 
domain and a second fragment of the enzyme, wherein 
activity of the enzyme is conditioned on the proximit y 
of the first fragment of the enzyme to the second 
fragment of the enzyme. 

78. The bacterial cell of claim 77, wherein the moiety capable 
of binding the receptor domain is capable of binding with 
a IC 5 o of less than 100 I^M. 



79 



The bacterial cell of claim 77, wherein the moiety capable 
of binding the receptor domain is capable of binding with 
a IC50 of less than 100 nM. 



80. The bacterial cell of claim 77, wherein the moiety capable 
of binding the receptor domain is capable of binding with 
a IC50 of less than 1 nM. 



81 



A method for identifying a molecule that binds a known 
target receptor in a bacterial cell from a pool of 
candidate molecules, comprising: 

(a) forming a dimeric molecule by covalently bonding each 
molecule in the pool of candidate molecules to a 
methotrexate moiety; 

(b) introducing the dimeric mol ecule into a bacterial cell 
culture comprising bacterial cells that express a first 
fusion protein which comprises the known target receptor 
domain against which the candidate molecule is screened, 
and a first fragment of an enzyme, and a second fusion 
protein which comprises a DHFR receptor domain and a secon d 
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fragment of the enzyme; 

(c) permitting the dimeric molecule to bind to the first 
fusion protein and to the second fusion protein, bringing 
the first fragment and the second fragment of the enzyme i n 
to proximity so as to reconstitute the activity of the 
enzyme; 

(d) selecting the bacterial cell that exhibits the activit y 
of the enzyme; and 

(e) identifying the small molecule that binds the known 
target receptor. 

82. A method for identifying an unknown target receptor to 
which a known molecule is capable of binding in a bacteria 1 
cell, comprising: 

(a) providing a dimeric molecule having a first ligand 
which has a specificity for the unknown target receptor 
covalently bonded to a methotrexate moiety; 

(b) introducing the dimeric mol ecule into a bacterial cell 
which expresses a first fusion protein which comprises the 
unknown target receptor domain, and a first fragment of an 
enzyme, and a sGcand fusion pro tein which comprises a DHFR 
receptor domain and a second fragment of the enzyme; 

(c) permitting the dimeric molecule to bind to the first 
fusion protein and to the second fusion protein, bringing 
the first fragment and the second fragment of the enzyme i n 
to proximity so as to reconstitute the activity of the 
enzyme; 

(d) selecting the bacterial cell that exhibits the activit y 
of the enzyme; and 

(e) identifying the unknown target receptor. 
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